Does Latent Semantic Analysis Reflect Human Associations?
نویسندگان
چکیده
In the past decade, Latent Semantic Analysis (LSA) was used in many NLP approaches with sometimes remarkable success. However, its abilities to express semantic relatedness have been not yet systematically investigated. In this work, the semantic similarity measures as provided by LSA (based on a term-by-term matrix) are compared with human free associations. Three tasks have been performed: (i) correlation with human association norms, (ii) discrimination of associated and unassociated pairs and (iii) prediction of the first human response. After a presentation of the results a closer look is taken to the statistical behavior of the data, and a qualitative (example-based) analysis of the LSA similarity values is given as well.
منابع مشابه
Query expansion based on relevance feedback and latent semantic analysis
Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...
متن کاملEvaluating automatic medical concepts associations with human judgments
This paper evaluates the associations of a subset of concepts extracted by VCGS (Vocabulary Cluster Generating System), a concept extraction and association tool, based on 6000 titles and abstracts downloaded from EBSCOhost Health Source – Consumer Edition database, against associations decided by 30 participants. The results show that after incorporating LSA (Latent Semantic Analysis) techniqu...
متن کاملStructurally Enhanced Latent Semantic Analysis for Video Object Retrieval
The work presented in this paper aims at reducing the semantic gap between low level video features and semantic video contents. The proposed method for finding associations between segmented frame region characteristics relies on the strength of Latent Semantic Analysis (LSA). Our previous experiments [1], using color histograms and Gabor features, have rapidly shown the potential of this appr...
متن کاملHierarchical Fuzzy Clustering Semantics (HFCS) in Web Document for Discovering Latent Semantics
This paper discusses about the future of the World Wide Web development, called Semantic Web. Undoubtedly, Web service is one of the most important services on the Internet, which has had the greatest impact on the generalization of the Internet in human societies. Internet penetration has been an effective factor in growth of the volume of information on the Web. The massive growth of informat...
متن کاملUsing Latent Semantic Analysis to Estimate Similarity
In three studies we investigated whether LSA cosine values estimate human similarity ratings of word pairs. In study 1 we found that LSA can distinguish between highly similar and dissimilar matches to a target word, but that it does not reliably distinguish between highly similar and less similar matches. In study 2 we showed that, using an expanded item set, the correlation between LSA rating...
متن کامل